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Abstract 

In this paper we complete and extend our previous work on stochastic control applied to high 
frequency market-making with inventory constraints and directional bets. Our new model admits 
several state variables (e.g. market spread, stochastic volatility and intensities of market orders) 
provided the full system is Markov. The solution of the corresponding HJB equation is exact in the 
case of zero inventory risk. The inventory risk enters into play in two ways: a path-dependent penalty 
based on the volatility and a penalty at expiry based on the market spread. We perform perturbation 
methods on the inventory risk parameter and obtain explicitly the solution and its controls up to first 
order. We also include transaction costs; we show that the spread of the market-maker is widened 
to compensate the transaction costs, but the expected gain per traded spread remains constant. We 
perform several numerical simulations to assess the effect of the parameters on the PNL, showing 
in particular how the directional bet and the inventory risk change the shape of the PNL density. 
Finally, we extend our results to the case of multi-aset market-making strategies; we show that the 
correct notion of inventory risk is the L2-norm of the (multi-dimensional) inventory with respect to 
the inventory penalties. 
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1 Introduction 

1.1 Variables 

We will work with time t € [0, T] and two controls, the half ask (resp. bid) spread of the market-maker 
5 + (resp. (5~), measured as the distance between the mid-price and her ask quote (resp. bid quote). Our 
model admits several state variables, which can be any process provided the whole system is Markov. This 
framework admits a large class of price models e.g. jump processes with stochastic volatility. Without 
loss of generality, we will restrict our analysis to the folowing ones: 

1. The mid-price S(t), assumed to be an Ito diffusion. 

2. The half market spread Z{t) 1 which is assumed non-negative and somewhat mean-reverting. 



3. The volatility £(£) of the mid-price process, which is strictly positive. 
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Figure 1: Market order intensities X ± . They are extrapolated when 6 < — z (dotted lines). 

4. The inventory Q(t), which is modelled as 

dQ(t) = dN-(t)-dN+{t), 
where dN + (t) and dN~(t) are two independent Poisson processes. 

5. The intensities A* of the previous Poisson processes are (i) exponentially decreasing in the distance 
to the quote on the other side, and (ii) their speed of decay K is random: 

\ ± {5 ± ) = Ae- K ^ +5 ^ 

6. The cash X(t), which is simply the money earned by the market-making by selling and buying the 
stock, i.e. 

dX{t) = [S(t) + 5+}dN+(t) - [S(t) - S^]dN-(t) . 

To keep things simple, we will use the notation 

Y(t) = (S(i),Z(t),E(t),K(t)) and y = (s, z, a, k) . 

The approach we propose is very general because it admits any number of state variables. In that spirit, 
one could add to Y(t) other state variables, e.g. a random intensity A{t) of the arrival of market orders 
or a statistical indicator coming from an alternative market. 

1.2 Hypotheses on the processes and controls 

We assume our controls 8 ± lie in an admissible space A. In order to have a well-posed stochastic control 
problem, we have to assume that the the system of all state variables is Markov, since otherwise our 
mathematical techniques do no apply. It is worth to mention that this assumption does not necessarily 
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imply having Markovian variables: as in the case of stochastic volatility, it is not the mid-price which 
needs to be Markov but the couple (mid-price,volatility). 

Another hypothesis we need is that all value functions u are finite. In the case of the value function 
([6]), we accept any mid-price process S(t) such that the function u in ([9]) is finite. This condition holds 
for any process such that its conditional expectation ^L t ,y[S{T)\ is affine in s = S(t). For example, a 
Brownian motion and an Ornstein-Uhlenbeck are acceptable mid-price processes. 

1.3 General HJB and optimal controls 

Let us start with the general HJB equation, i.e. 

(d t + C)u+ max Ae~ k[z+s+] \u(t, y,q-l,x+(s + 5+)) - u(t, y, q, x)] (1) 

+ max Ae~ k[z+S ~ ] [u(t, y,q + l,x-(s- <T)) - V, 9> x )] = -/(*, V, 9> x ) , 

u(T,y,q,x) = g(y,q,x)), 

where C is the infinitesimal generator for the state variables y. The steps to solve ([T]) are as follows: 

1. Based on the utility function we make an ansatz, i.e. we guess the general form of the solution of 
0, i.e. 

u(t,y, q,x) = x + v g (t, s) . (2) 

2. We substitute the ansatz ^ in ([I]) in order to find an easier HJB equation for v. We use this 
new HJB equation to find the optimal controls that maximize the jumps. Indeed, after elementary 
calculus, where the jump part of ([!]) is considered a function of (i5 + , i5 _ ), one finds that the optimal 
controls are 

St = £ -s + Vq(t,y)-v q -i(t,y), S~ = i + s + v q (t,y) - v q+ i(t,y). (3) 

3. We substitute the optimal controls in the HJB equation: the resulting equation is called the verifi- 
cation equation. In our case it is 

(d t + C)v q + ^(e- k ^ +S ^+e- k ^ +& ^) +f(t,y,q,x)) = 0, (4) 

v q{T,y) = g(y,Q,x), 

which is highly nonlinear because Sf 1 = 6jjr(v q ). 

4. We cannot solve directly the verification equation Q via the Feynmann-Kac representation formula 
because the former is nonlinear and the latter works only for linear equations. However, the idea 
is to decompose our nonlinear problem into several linear problems, and apply Feynmann-Kac to 
each one of them. Recall that for a linear equation of the form 



(d t + £)w(t,y,q,x)+h(t,y,q,x) = 0, 

w(T,y,q,x) = g(y,q,x), 
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the (unique) solution is given by the Feynmann-Kac formula (see e.g. Pham [TP] ) 



w(t,y,q,x) = E ; 



't,y,q,x 



g(Y(T),Q(T),X(T)) + 




(5) 



where ^t,y,q,x is the conditional expectation given Y(t—) = y, Q(t—) = q and X(t— ) = £. As it will 
be evident throughout this work, our approach relies entirely on the Feynmann-Kac formula 
Therefore, as long as our general Markov processes are such that the expectation in ([5| is finite, we 
are on solid ground. 

5. Alternatively, we could express the optimal quotes in terms of the market-maker's bid-ask spread 



Notice that 5+ = 5^ if and only if r*(t) = s = S(t). Therefore, — s measures the level of 
asymmetry of the quotes with respect to the mid-price s. 

There is an important remark on the shape of the optimal controls (|3j). If the only state variable 
is the mid-price then v q {t,y) = v q {t,s). Moreover, if the mid-price is a martingale then we can assume 
that v q = v q (t) because, by definition, there is no directional bet on the price. Under these assumptions, 
plugging the explicit optimal controls Q into the verification equation Q leads to a system of ODEs for 
v q , indexed by q. This system can be solved numerically, but is is nearly impossible to have an explicit 
formula (see e.g. Cartea-Jaimungal |4] and Gueant-Lehalle- Fernandez [7]). In our case, with a general 
price process and several other state variables, there is no hope for an system of ODEs; in fact, the 
corresponding system is PDE-based. Therefore, either we solve the problem explicitly (as we will do in 
the linear case), either we perform some asymptotics for the optimal controls (as we will do in the case 
of inventory penalty). 

2 Linear utility function 

2.1 Hamilton-Jacobi-Bellman (HJB) equation 

Here we will try to find the optimal controls S ± that maximise the PNL profit and loss) of a market-maker, 
i.e. a linear the value function, i.e. 



V>* := 6+ + s; 



and the centre of her spread 




u(t,y,q,x) = max E, 



8±eA 



[a(Y(T),Q(T),X(T))] 



(6) 



where E, 



't,y,q,x 



is the conditional expectation given the values of all variables at time i, i.e. 



E t , y ,q, x [a] := E [a\Y(t-) = y, Q(t-) = q, X(t-) = x] 



G 
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The probabilistic representation ^ is the unique solution of the HJB equation 

(d t +C)u+ max Ae~ k[s++z] \u(t, y,q-l,x+(s + 5+)) - u(t, y, q. x)] (7) 

S+GA 

+ max Ae~ k[s ~ +z] \u(t, y 7 q + l,z, k.x - (s - S~)) - u(t,y, q. x)] = 0, 
s-eA 

u(T,y,q,x) = a(s,q,x), 

where L is the infinitesimal generator for all the continuous state variables y. Therefore, we will use ^ 
to find the optimal controls 6 . 

Our linear utility function here is simply the PNL of the market maker, i.e. 

a(t, y, q, x) = x + qs . 

Its corresponding value function is thus 

u(t, y, q, x) = max E ttVtq<x [X{T) + Q{T)S{T)] . 

In this case, the optimal controls are the bid and ask quotes that maximises the PNL of the market-maker 
throughout the trading day. 

2.2 Ansatz and HJB 

We will look for a solution of the form 

u(t,y,q,x) = x + 9 (t,y) + q9i(t,y) . 
With this ansatz, the HJB equation takes the form 

(d t + C)(0 O +q0i)+ max A e - k[z+s+] [s + 5+ - 6> x ] + max A e -^ z+s '^ [s + 5+ + 6 X ] = , 
s+eA ' s-eA 

0o(T,y) = 0, 
9 x {T,y) = s. 

2.3 Computing the optimal controls 

Define 

f + (5+) := Ae-^ z+s ^ [s + 5+ - X ] . 
Using elementary Calculus we find that its maximum is attained at 

k 

Analogously, for 

f~(S~) := Ae-^ z+s ^[-s + 5- + 6{\ . 

the maximum is attained at 

6Z = \ + a-9 1 . 
k 
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With the optimal half-spreads 



we can easily compute the optimal spread ip* for the market maker, along with the centre r* of her spread: 

^. := 6+ + 6- = j , r*:=s + l (S+ - S~) = 6 l . 
k l 

Notice that 0q is necessary only for the solution of the HJB equation, not for the controls. Indeed, we 
only need to find 9\ in order to have our controls explicit. 

2.4 Solving the verification equation 

With the optimal controls, the HJB equation reduces to 



From the explicit form of 6 we have 

/+(#) := 4e-*"e-*[*-] , /" 
ek 

Therefore, the verification equation can be rewritten as 

2 A 

(d t + C)(6 + q6i) + —e~ kz cc 
ek 



We separate Q into two equations, one for each one of our unknowns 

2A 



k[z+8-]\ _ 


0, 


Oo(T,y) = 


o, 


0i(T,y) = 


s . 




■01-s) 


ek 




h-s]) = 


o, 


] o{T,y) = 


o, 


>i(T,y) = 


s . 



(d t +C)0 o + ^fe- kz cosh (k\6 1 -si) = 0. 
ek 

0o(T,y) = 0, 



(8) 



and 

{d t + c)e x = o, 

0i(T,y) = s. 

Let us define 

A(t,y) :=Et, y [S(T)]-s, 
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which measures the difference between the expected value of the mid-price at maturity and its current 
value. With this notation, and using the Feynman-Kac formula twice, first for 9\ and then for 9\, we find 



Oi(t,v) 
&o(t,y) 



A(*,j/). 



-E 



-KZ 



K 



cosh(ifA) d£ 



where all capital letters inside the integral are evaluated at £, i.e. K = K(£), A = A(£, Y(£)), etc. This 
leads to explicit expressions for the (unique) solution and its controls: 



u(t,y,q,x) 

Whold 



Uhoid(i, V, q, x) + u mm (t, y) , 
x + q(s + A) , 



(9) 



-E 



t,y 



e = 

if>* = 
r* = 



±A. 



e 
1 

k 
2 

fc' 

s + A 



.4 



-KZ 



cosh(KA) d£ 



2.5 Remarks 



Let us explain the decomposition of the solution u given in (|9| in terms of Mhoid and tt mm . 
On the one hand, the function Whoid is the expectation at expiry T of the current portfolio x + qs; this 
corresponds to a buy-and-hold strategy. On the other hand, the function u mm (as the integral from t 
to T suggests) is the profit for playing a dynamic (i.e. high-frequency) market-making strategy. Since 
i*mm > 0, the addition of the market-making mechanism it mm to the strategy u is more profitable than 
the buy-and-hold strategy u^oid alone. In consequence, it makes sense to play market-maker dynamically 
instead of simply apply a buy-and-hold strategy. 

Since in principe T can be very big, in order to compare strategies we have to compute the value 
function per time unit, i.e. u(t, y, q, x)/(T — t). Using the integral version of the mean-value theorem, it 
follows that there exists in r £ (t, T) such that 



u(r,y,q,x) _ x + q(s + A) 2 



T-t 



T -t 



.4 



-KZ 



cosh(iYA) 



In particular, when t = we have q — 0, x — and A = 0. 
expected gain of the market-maker is 



Therefore, at the beginning of the day the 



tt(0,S/,0,0) _ 2^ 
T ~ e T ' y 



A 



-KZ 



cosh(iCA) 



2 

> -Er, 

e 



A 



-KZ 



The first observation is that the worst mid-price dynamic is the martingale. Indeed, £ i-> cosh(£) 
has a strict minimum when £ = 0, and a non-martigale process has times r where A(r) 7^ 0. This seems 
counter-intuitive because, the market-making being lightning fast, it should not be influenced by the 
long-range behaviour of the mid-price. However, the inventory turnover is slower because it takes several 
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trades to build up and come back to zero, and that is where the directional bet cosh(i"T A) enters into the 
game. 

Another feature is the effect of a non-constant decay rate K. Notice that the optimal spread i/j* 
is decreasing in k and the value function u is decreasing in K. If the intensity of order arrival increases, 
which can be interpreted as cither less market orders or a more populated limit order book (LOB), then 
the market-maker has to reduce her spread to keep her order flow constant. But this makes her PNL 
smaller because she is selling liquidity cheaper. 

The effect of the market spread Z is very interesting. On the one hand, z does not appear at 
all in the optimal controls, which means that the market-making strategy is independent of z. On the 
other hand, the value function u is decreasing in Z. In consequence, the PNL of the strategy decreases 
as the spread increases. This is a direct consequence of the hypothesis that the intensity of the arrival of 
market orders depends on the distance to the other side of the book, not on the distance to the mid-price. 
Indeed, the bigger the spread, the less market flow captured by the market-maker, even if her position 
relative to the mid-price does not change. 

We have put the intensity A into the expectation operator. This is because our formulation allows a 
stochastic intensity A(t). In fact, our formulation allows as well asymmetric intensities A ± (t) and 
decays K ± (t); the only difference is that the symmetry via cosh(-) is lost and we would have 2 different, 
complicated exponentials instead. 



3 Linear utility function with inventory penalty 

3.1 Two inventory penalties 

We will penalise the inventory in two ways: 

1. A penalty ni at expiry, depending on the spread. This models the fact that the market-maker will 
have to clear her inventory at the market, and as such she will pay the spread for each share: 

Il 1 (T)=r ] Z(T)Q 2 (T) 1 ri>Q. 

For example, if z = 1 we recover the Stoll model for a quadratic penalty on the inventory (see 

2. An integral penalty II2 of the (squared) inventory during the remaining of the trading session, 
weighted by the volatility £(i) of the mid-price S(t). This is a sort of tracking error with respect 
to a flat-inventory position, and is a very standard choice (see e.g. Guilbaud-Pham jS] and Cartea- 
Jaimungal [1]): 

n 2 {T) = v J £ 2 (£)Q 2 (£R, ^>o. 

In this section, we will consider the following value function: 

u(t,y,q,x) = maxE t ^ yq Jx(T) + Q(T)S{T)-eU(T)], U:=U 1 +U 2 . (10) 

Notice that if e — we recover the previous linear case. Concerning the parameters, the important one 
is e since it gives the penalty as a perturbation of the value function we already computed. The other 
parameters 77 and v can be considered as booleans, so that we can assess a porteriori the effect of the 
two penalties on the resulting controls. 
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3.2 The HJB equation and the ansatz 

The resulting HJB equation is thus 

(d t +C)u+ max Ae~ k[z+5+] [u(t, y,q-l,x+ (s + 6+)) - u(t, y, q, x)] 



8+eA 



(11) 



+ maxie k[z+s 1 \u(t,y,q + l,x - (s - S )) - u(t,y,q,x)] 

u(T,y,q,x) 



2 2 

evo q" . 



x + sq — erjzq 



Recall that we have an explicit formula for the (unique) solution when e = 0. Therefore, we will use 
perturbation methods on s. More precisely, we propose the following ansatz: 



u(t, y,q,x) = x + (t, y) + ev™ (t, y) + O (e 2 ) , 
3.3 Verification equation and its linearisation 



With the ansatz (12), the optimal controls take the form 



1 



6- = ~ k +s + v^ 



(o) , 

Vq+l + £ 



„(!) _«W 



+ 0(e 2 ) 



Under these conditions, the verification equation is 



(dt + C) («(' 



(o) 



(1) 



fc2 e fc;5 exp 

ek 



e fc2 e~ fes exp 



A 

ek 



2 2 

evo q 
sq, 

—srjzq 2 



,(D 



Observe that we can rewrite the jump term in 5j as 



A 

ek 



-kz ks „—k9) 



exp |— fee 



0( £ 2 )} 



In consequence, its first-order expansion in e is 

.4 



ek 



-kz e -k[e^- s ] | x _ fc£ 



Analogously, for the term in 6* we have 



.4 

i 

ek 



-fc« e *[«i 0) -.] x (i_ fc£ 



y(l) 
9 



0(£ : 



(12) 



(13) 



0(e 2 )} (14) 
+ 0(e 2 )} 
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Therefore, it follows that the linearisation in e of the verification equation (14) is 

A 



v^(T,y) = sq, 
v<p(T,y) = -eW 



|l - ke 
ek I 



ek 
A 



„(i) _„« 

U q U q-1 



V q V q+1 



(15) 



9 1 

= evo ■ q , 



3.4 Solution for e° 



At zero-th order in e, which is equivalent to set e ~ 0, we recover the verification equation of the linear 
case without inventory penalty (18]), i.e. 



(d t + C) (C + qe[° } ) + ~e~** cosh (k[9^ 



0, 



ef\T, v ) = o. 

6 { ?\T,y) = s. 



Therefore, v q °\t,y) is exactly the solution we found before, i.e. 



C(t,y)+ q 8?>(t,y), 
s + A, 



3(0)/ 



(16) 



i°\t,y) = \, y 



A 



KZ 



cosh(i^A) d£ 



where (as before) A := E t , y [S(T)] — s. This fact is our main motivation for the application of perturbation 
methods on the inventory constraints. 

3.5 Solution for e l 



Keeping only the first-order terms in (15), i.e. those with e, we obtain 
(dt + C) (0« 



? (D 





e 1 


-9[ 1} + (1 - 2q)8 { 2 1 




e 1 


e[ 1) + (i + 2 q )0i 1) } 




2 2 






= vo q , 






= o, 






- o, 






= — rjz . 





(17) 
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We decompose (17) into three equations, and as before we solve one by one via the Feynmann-Kac 
formula. The first equation, which regroups the terms in q 2 , is linear: 

{d t +C)9^ = vo\ 



5(1) 



(T, y) = -rjz . 



Its (unique) solution is 



0'£\t,y) = -rjE t , y [Z{T)]-vE- 



(18) 



Since 9^ and 9^ are already known, the equation for the terms q 1 becomes linear in 9^\ i.e 



3(1) 



(i) 



{d t + C) 9[ 1} + — e - kz sinh(fcA)^ 1) 



Therefore, its (unique) solution is 



Finally, for q° we get 



9^(t,y) = -E t , y 



i 1] (T,y) = 0. 



J Ae~ KZ sinh(KA)9 { 2 1) d£_ 



(19) 



e -k, e -kA 



e' kz e +kA 



whose (unique) solution is 



Ae~ KZ ^9[ 1] sinh(fcA) + 9^ cosh(fcA)} d£ 



(20) 



Putting all together (12), (16), (18), (19), (20) we finally obtain the explicit expansion of u(t,y,q,x) 
up to first order in e. 



3.6 Optimal controls 

Now we are in measure to write down explicitly the optimal controls up to first-order in e. From the 
ansatz ( 12 ) and the control equations ( 13 1 it follows that 



s(9[ 1 >-(l-2q)9^)+0(s 



(i) 



(21) 



6- = 



3 (0) 



(i) 
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\-2e0^ + O(e 2 ) 



This implies that the optimal spread for the market-maker is 

ip* = st + s- 

whilst the centre of her spread is 

r, = s + \ (St 8-) = 0<°> + e{e^ + 2<0< 1 >) + O (e 2 ) . 



(22) 



(23) 



In the light of all the former computations, we can now write down in full splendour the optimal controls 
for the market-maker: 



St = *+A + J-E t , 



S* = £ - A + e I E tiV 



1; 



+ 2en + (e 2 ) , 



s + A - e { E t , y 



- (1 + 2q)% \ + O (e 2 ) , 



(24) 



+ 2q% \ + O (e 2 ) 



where A := E tyV [S(T)] — s is the degree of non-martingality of the mid-price (i.e. the directional bet), 

H := -Ae~ KZ smh(KA) 
e 

is the (marginal) profit of the market-making as a fuction of the directional bet A, which is a function of 
A, and 

ir:=rjEt,y[Z(T)] + vEt,y S 2 <% >0 
is the unitary inventory-risk penalty. 
3.7 Remarks 

The unitary inventory-risk penalty n has two components. The term in v (the integral one) penalises 
a non-flat inventory via the volatility thoughout the day. The term in 77 is a penalty that triggers only 
near the end of the day, but strong enough to force the market-maker to leave the trading floor with zero 
inventory. Both penalties are complementary: one keeps the inventory within range during the trading 
day whilst the other forces the inventory to finish the day near zero. 

Notice that the optimal spread V'* is increasing in the unitary inventory-risk penalty n. 
This feature can be understood in a very intuitive way: if a market-maker is more sensitive to an inventory 
risk then she will be more conservative in her quotes, fearing that even a small price jump would put her 
on the wrong side of the trend. Her inventory-risk aversion thus translates into a wider spread. However, 
the inventory q does not appear at all in the expression for ^>*. Indeed, it is the unitary inventory-risk 
aversion which determines the width of the spread, not the current inventory level. 
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Observe that the center of the spread is decreasing in the inventory q. This is also 
very intuitive: the market-maker will tilt her quotes, rendering them asymmetrical, in order to favour 
execution vs incoming market orders which help her reduce her (absolute) inventory. For example, if she 
is long inventory (q > 0) then she will post aggressive bid quotes to lure selling market orders. At the 
same time, her ask quotes are more conservative because she does not want to increase her inventory via 
buying market orders. If q < then the market-maker will post conservative sell orders and aggressive 
buy orders, hoping to reduce her inventory via an asymmetrical market-flow. 

The directional bet A is present in the centre r*. For example, if A > then the market- 
maker expects a final mid-price higher than the current one. Therefore, if she is willing to carry some 
inventory-risk, she will post aggressive bid quotes and less aggressive ask quotes. As a result, in average 
the inventory will be positive, reflecting the long bet in the mid-price. As we can see, the dynamic of 
the centre is governed by two opposite effects: A tries to build up the inventory to profit from the 
difference between the current mid-price and the estimate of the final mid-price, whilst the term in s 
aims to keep a flat-inventory position. 

If the midprice is a martingale then A = and = s — 2qeii + 0(e 2 ), i.e. we get rid of the 
integral term in sinh(-). This implies that the integral term in sinh(-) measures the profit of the 
market-making strategy with respect to the directional bet A^O. Now assume that in the non- 
martingale case we want more simplicity on the formulas, namely we totally discard the integral terms 
in sinh(-) and in the volatility E 2 ; in fact, this is equivalent of a zero-th order expansion in T — t of the 
optimal controls. It turns out that we recover the optimal quotes in Fodra and Labadie [6], which were 
obtained by linearising the verification equation without much care on the accuracy of the approximation. 
This means that the integral terms, which are are a path-dependent, offer a correction based on the time 
to maturity T — t and the degree of non-martingality A. 



3.8 The effect of transaction costs 

Suppose that the market-maker pays a fixed fee of a for each traded asset. In most venues a > 0, but 
there are some trading platforms where a liquidity provider receives a rebate, i.e. a < 0. Since the 
transaction cost a affects the cash process each time a share is traded, we have to modify ([!]) as 

{d t +C)u+ max A e - k[z+s+] \u(t, y, q - 1, x + (s + S + ) - a) - u(t, y, q, x)] (25) 

S+GA 

+ max Ae _fe[z+<5 1 \u(t, y. q + 1. x - (s - 8~) - a) - u(t,y, q,x)] = eva 2 q 2 , 



2 



u(T,y,q,x) = x + sq-erjzq 



Let be the optimal controls without transaction costs, and $£*(z) the optimal controls under 
transaction costs. If we define 



H(a) := -Ae- K(z+a) smh(iTA) 
e 
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then repeating the previous computations we obtain 



V'a* 



a + A - 



-E 



t,y 



+ a- A + e<E t . 



H(a)M£ 



+ (1 - 2 9 )tt| + O (e 2 ) 
-(l + 2q)n\+0(e 2 ) , 



(26) 



- + 2a + 2en + O (e 2 ) , 











J H(a)nd£_ 


+ 2 9 7f| 



As we can see, the transaction cost a widens the market-maker's spread by 2a, i.e. 

ip a * — i>* + 2a . 

If we recall that each time the market-maker trades the spread she pays exactly 2a, we conclude that her 
gain per traded spread is constant with value ip*, regardless of the transaction cost a. This feature has 
two immediate consequences: 

• If a > then the market-maker tries to compensate her loss in transaction costs by widening her 
spread by 2a. By acting this way, she keeps her gain per traded spread constant but accepts a 
smaller probability of execution since she is moving her quotes further into the LOB. If a is too 
big then her spread could be so wide and far from the best quotes that the market-maker will not 
capture any market flow at all. Now imagine a scenario where most market-makers have the same 
behaviour: if they all have wider spreads then this would in aggregate increase the market spread 
2z. In consequence, the liquidity would diminish since the spread gets bigger and thus market 
orders will get executed at a price further from the mid-price. 

• In the presence of a rebate (i.e. a < 0) the market-maker systematically reduces her optimal 
spread. More precisely, she uses the rebate to improve her quotes, keeping her gain per traded 
spread constant but increasing her probability of execution by placing bets closer to the mid-price. 
If a is negative enough to have ^/> a * = then the market-maker could even accept to buy and sell 
at the same price, earning no profit in the operation except for the rebate. This strategy is called 
scalping or rebate arbitrage, and has been commented by several practitioners (see e.g. Bodek and 
Shaw 0). 

From the previous observations, it follows that the fee structure in high-frequency trading is crucial 
for the liquidity offer (measured as both the size of the spread and the available volume in the LOB) 
and the profitability of high-frequency market-making strategies. Moreover, trading venues with a rebate 
mechanism have smaller spreads, are more profitable for market-makers and lure non-market-making 
arbitrage strategies. These findings, although they are based on our model and its hypotheses, are 
consistent with several other theoretical and empirical studies on high-frequency trading. 

For example, Colliard and Hoffmann [5] found that the French financial transaction tax (FTT) imposed 
on August 2012 has reduced the number of aggressive limit orders, i.e. those that reduce the spread. The 
FTT also reduced the depth of the market, i.e. the sum of available volume at best bid and best ask, as 
well as the trading volume. Finally, the market spread and the volatility were unaffected. 
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Comparing our results with those of Colliard and Hoffman, one could argue that the loss of volume 
at the best quotes and inside the market spread was due to traders increasing their spreads as a response 
to an increase of a. That said, it is worth to mention that FTT did not apply to market-makers who are 
members of the exchanges. In consequence, these fee-exempted market-makers continued offering liquidity 
at the best quotes, which explains why the market spread did not change with the FTT implementation. 

4 Numerical Simulations 

We performed several numerical simulation in order to assess quantitatively the statistical properties 
of the market-making strategy and its parameters, esp. the mid-price pattern (martingale or mean- 
reverting), the inventory risk e and the transaction costs a. All simulations were performed in R. 

The mid-price in all cases will be an Ornstein-Uhlenbeck process, i.e. 

dS(t) = a[S(t) - im] + adW{t) , 

which satisfies 

E M [5(i)] = .se- a ( T - t }+ A1 (l-e- a ( T - t )) 
A = ( M - S )(l-e-°( T - 4 )) . 

Recall that for a martingale (e.g. Brownian motion) we have Et iS [£(£)] = s and A = 0. 

The parameters we use, unless explicitly stated in the figures, are 10,000 simulations, 1000 events per 
day, T = 1, A = 1000, k = 1, initial price 5(0) = 3000, a = 0.1, /z = 3009 (i.e. +3% of initial price), 
a = 0.5, z = 0.5, e = 0.001, t) = 1, v = 1, a = 0.05. 

In Figure[2]we represent the martingale strategy, whose controls Sf 1 are rather constant, around ±1.0. 
They small fluctuations are due to inventory, but are independent of the mid-price. When the time 
t < 200 the quotes are tilted downwards: the market-maker has a positive inventory, thus she improves 
her ask price in order to sell. At t = 200 there is a jump upwards on the quotes, following a drop in 
the inventory. At the end of the day there is less fluctuation on the quotes: the inventory penalty due 
to the volatility is near zero, and the penalty at expiry does not trigger because the inventory is rather flat. 

In Figure [3] the quotes have stronger fluctuations. Near t = the quotes are downward, reflecting the 
fact that the mid-price is below its mean fi (red-dotted line) and the market-maker bets the price will go 
up. Around t = 50 the price crosses fj, and sits above /i: the market-maker bets for prices to come back 
later, hence she gets aggresive selling quotes and builds up a negative inventory. At t = 200, when the 
price comes back to /i, she starts to unwind some of her position. At the end of the day the mid-price 
plunges below fi, but there market-maker does not follow the buying signal: the dynamic is no longer 
driven by a directional bet but rather on clearing her inventory, which explains the steady slope in her 
inventory towards zero instead of a peak upward. 

In Figure [4] we plotted the histogram of the maximum and minimum intraday inventory. On the left- 
hand side we have the martingale strategy, which is rather symmetric: the strategy has no directional 
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Figure 3: A trading day with Ornstein-Uhlenbeck (i.e. mean reverting) controls. 
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Figure 4: Min-Max intraday inventory. 
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Figure 5: Comparison of directional patterns: martingale (M) vs mean-reverting (OU). 
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PNL densities vs epsilon 
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Figure 6: The effect of the inventory risk e on the mean-reverting strategy. 
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bet, although the mid-price has an upward trend. On the right-hand side we have the mean-reverting 
strategy, whose histogram is skewed to the right. The market-maker tends to be positive in inventory 
because she bets the price will go up. The inventory risk is much bigger for Ornstcin-Uhlenbeck than 
for the martingale: the inventory of the latter is around [—60, 60] whilst that of the former is around 
[-400,500]. 

In Figure [5] we compare the effect of the directional bet on the PNL density. On the one hand, the 
mean-reverting strategy earns more than the martingale one: it has higher mean, median and higher 
quantiles. This happens because the market-maker knows the pattern and the parameters of the mid- 
price, and exploits this information. On the other hand, the mean-reverting strategy is more risky than 
the martingale one, as it can be inferred from the lower quantiles and the higher-order moments (standard 
deviation, skewness, kurtosis). In consequence, a direcional bet makes the strategy more profitable but 
more risky. 

In Figure [6] we show the effect of the inventory-risk parameter e on the PNL density. It is clear 
that as s increases, the PNL density has thinner tails and is more concentrated, which corresponds to a 
substantial risk reduction, as we can see in the quantiles and the higher-order moments. Although there 
is a decrease in the average daily gain, the (daily) Sharpe ratio is increasing in e. 

In Figure [7] we see the effect of the transaction costs a on the PNL density. There is a significant 
reduction on both the average daily gain and the (daily) Sharpe ratio, which implies the market-making 
becomes less profitable in the presence of higher transaction costs. For the other statistics, there is no 
significant change. One could thus be tempted thus to think that the effect of a on the PNL density is 
just a shift towards the left, but that is not exact because the shift is visible on the right tail and the 
head but it is negligeable on the left tail. In consequence, the transaction costs reduce the performance 
but they do not reduce the overall risk profile. 

5 Mult i- asset model 

Assume now that we have M different assets or mid-prices Si, ... , Sm- Our approach allows for general 
Markov processes, but for simplicity (and without loss of generality) let us suppose that S(t) is an 
M-dimensional Ito diffusion 

dS(t) = a(t, S)dt + E(i, S)dW(t) , 

where a(t, S) <G M M is the drift and dW(t) is an M-dimensional standard Brownian motion. The M x M 
matrix £(t, S) is the square root of the (symmetric, positive-defined) variance-covariance matrix A, i.e. 
A = £'£. The inventory Q(t) is driven by the M-dimensional couple (N + (t), N~ (t)), where the i th 
component (N^~ (t) , N~ (t)) is a Cox (or inhomogeneous Poisson) process having controlled intensity: 

(W), W)) : = (A i e- k ^ +s t\A i e- k ^ +s ^) (27) 
In summary, the SDEs that govern the dynamics of the processes are 

dX(t) = [S(t) + 5+(t)}-dN + (t)-[S(t) + 5+(t)}-dN-(t), (28) 
dS(t) = a(t,S)dt + E(t,S)dW(t), 
dQ(t) = -dN + (t)+dN-(t), 
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where X(t) e K, Q(t) G Z M and S(t) e K M . 

5.1 Inventory penalties 

As before, we have two penalties. 

1. A penalty at expiry, 

III := vQ'(T)Q{T)Q(T) , ry>0, 

where ft is an M x M symmetric, positive-definite matrix depending on some stochastic processes. 
For example, if the penalty is the cost of passing market orders at the end of the trading day then 
then Slij — SijZi, i.e. ft is diagonal and coincides with the vector spread. However, f2 could be any 
matrix that takes into account the cost of executing the M-dimensional portfolio Q(T) at market, 
including (but not limited to) the variance-covariance A = S'E. 



2. An integral (thus path-dependent) penalty 

n 2 -.= uJ t T Q'(OHOQ(0^. 



For each asset Si we have two controls 5 i in an admissible space Ai- this means that the vector 
controls S^ 1 = (Sf, . . . , 8^) lie in the admissible space A = A\ x • • • x A m . Under this framework, The 
value function is thus: 

u(t,y,q,x)= m&xE t . v , q Jx(T) + Q(T)-S(T)-eII(T)] , U:=U 1 +U 2 , (29) 

5.2 The HJB equation and the ansatz 

The HJB equation is 

(dt + C) u + max P[jump] \u(t, y + , q + , x + ) — u(t, y, q, x)] = evq'Aq, (30) 

5±eA 

u(T, y,q,x) = x + s ■ q- enq'Vtq , 

where P[jump] is the probability of having a jump (i.e. market orders hitting the market-maker's quotes) 
at time t and u(t 1 y + ,q + ,x + ) is the value function after the jump. As it is customary with several 
independent Poisson jumps, we will assume that they cannot occur simultaneously. This means that the 
jump 

u(t,y + ,q + ,x+) - u(t,y,q,x) 

can be decomposed as the sum of the jumps over i. More precisely, denote qi the vector q without its 
z-th coordinate, i.e modulo index permutation we have q — (%,<?»). Then the jumps are 

M 

u(t + , y + , <7 + , x + ) - u(t, y, q,x) = [u(t, y,q i ,q l -l,x + (s, + 5+)) - u(t, y, q, x) 

i=l 
M 

i=l 
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In consequence, the HJB equation (30 1 takes the form 



A I 



M 



(d t + + V max Ae ] u(t,y, q u q* - 1, x + (a,- + 5f)) - u(t,y, q, x) 

< * £+ s- A 



(31) 



max Ae- k ^ +s - 1 



u(t, y, q t , q l + l.x- (sj - 8 i )) - u(t, y, q, x) 
u{T,y,q,x) = x + s-q 



evq'Aq , 
er/q'ilq . 



We will again use perturbation methods on e and search for a solution of (31) with the following 
shape: 



i(t,y,q,x) = x + v^{t,y)+ev^{t,y) + 0(e 2 ) 

M 

vW(t,y) = ^, y ) + 5>^ (0) (i,y), 



(=i 

M M M 

i=l i=l j=l 



(32) 



where 



(i) = 

ij ji ' 



5.3 Verification equation and its linearisation 



With the ansatz (32), the marginal jumps can be readily computed: 



9,-1 U qi ~ °i 
(°) _„(0) _ fl(0) 



A/ 



M 



j'=i 



(33) 



Therefore, since the marginal optimal controls are 



/)+ - — 4- S- +7I (0) -7I (0) + 



w (l) _ „(!) 



w (l) 



+ 0(e 2 ) , 
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it follows that 



St 



(o) 



i 



■s-d^+e 



A/ 



n(i) 



2^6, 



(i) _ fl(i) 

ij ii 



i=i 



A/ 



+ 0(e 2 ) 



(34) 



i=i 



•O £ 2 



Under these conditions, the verification equation is 



A I 



.4 



£ ±L e -*i* e k«., e -Mj u 'exp^-fc l£ 



Af 



eft 
.4 



□(i) 



A/ 



+ E 

i=l 

= evq'Aq , 
vf\T,y) = s-q, 
vW(T,y) = - m 'nq. 



« e -M. e -*..« e Wi 



exp < fcjg 



n(i) 



*E< 

M 

2 E* 

i=i 



The linearisation in e of the verification equation ( 35 ) is 



M 

E 

i=l 
M 

E 

i=l 

evq'Aq , 
s-q, 
—r/q'tlq . 



iZi e kiSi e ki 



1 - 



f + fc;£ 



M 



3 (1) 



a(i) 



A/ 



2E^? 

i'=l 



2E^? 

i=i 



3 (l) 



3 (l) 



+ 0(e 2 ) 

(.8.-,.) 



0(e 2 ) 
0(e 2 ) 



(36) 



5.4 Solution for £° 

At zero-th order in £ we get 



A/ 



(9 t + C) «f + E ^" fc ^ cosh (fo^f - Si ] 



< } (r,y) 



0. 



(37) 
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We decompose (37 1 in its M + 1 components. For ^g -* we obtain 

M 



(a t + £)C+E 



2A, 



cosh ( ki [9 



3(0) 



t\T,y) 



= 0, 
- 0, 



whilst for we get 

(d t + c) ef 

ef\T,y) 

Using Feynman-Kac (recursively) we obtain 



3 (0) 



q(0) 



(t,y) 



Si + Ai 

M _ 



(t,y) = Y,Z E 



-KiZt 



cosh(if i A l ) di 



where Aj := E til , [Si(T)] — Sj. As in the single-asset case, the solution 



T A 
K 



* e -K iZi 



cosh^A,) d£ 



(38) 



(39) 



(40) 



M M ^ 

u(t,y,q,x) = x +y^gj(sj + A,) + -E ti!/ 
i=l i=l 6 

is exact for e = 0, i.e. for the problem without inventory constraints. As before, the first two terms of 
the right-hand side are the buy-and-hold strategy, i.e. the expected gain of keeping the current inventory 
position until the end of the trading day. The third term takes into account the market-making thoughout 
the day (that is why it has an integral term) with a directional bet (given by the Aj's). Since the third 
term is strictly positive then it is more profitable to play market-making than buy-and-hold only, and it 
works better for non-martingale mid-price processes. 



5.5 Solution for e l 

At first order in e we have 



A, 



M 

— e * 'e 
e 

t=i 



1j 



Ai 



M 

— e " *e ' 
e 

i=l 

vq'Aq , 
—rjq'Q.q . 



M 

3=1 
M 

J=l 



3(1) nW 



3(1) 



We decompose (41 1 into several components. For 6^ we obtain 



M 



i=l 



2Jk 

eki 



j — kiZi 



sinh (hAA 0f ] + V ^ e - *** cosh {hAA 6 

' J pic - 



(1) 



1=1 



eki 



i'H^y) 



(41) 



(42) 
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for 6*-^ we have 



AA 4 



M 



6 3=1 



(T,y) = 0; 



and for 0y we get 



(d t + C)9^ = vAij , 



Solving (recursively) with Feynmann-Kac yields 



■ 4 E 



/.y 



M 



/ A ie -^^ sinh( J K' i A i ) J] fry d£ 
^ ^e"*' 2 * {flf > sinh(X i A i ) - fr„- cosh^A,)} d£ 



where 



fry := ?y]E tiy [fty(T)] + i/E t)1 



(43) 



(44) 



(45) 



Putting all together (32), (401, (45) we obtain the explicit expansion of u(t,y,q,x) up to first order in e. 



5.6 Optimal controls 



Plugging the explicit expressions (40), (45) into (34) yields 



St = 



A, + e { — Et ti 



/ Aie~ KiZi sinh^A,) V fry d£ 
Jt j=l 



M 



2^2q j Tt ij +nu f +0(e 2 

3=1 



(46) 



ST, = --A l+ e{+-E tty 



r T M M 

/ Aie~ KiZi sinh(ifiAj) ^ fry d£ I + 2 V ^ffy + fr. 

Jt 3 = 1 3 = 1 



O e 2 



Let us lighten the notation via vectors. If we define 





' si - 




- l/h ' 
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" 1 " 








_ l/k M _ 
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. A m . 


, 1 = 


_ 1 _ 
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TTll 



and 



e 



7TM1 ' ' ' 7TMM 

A ie " KlZl sinh( J ftT 1 A 1 ) 




Diag(^) 



Til 



TJWM 



a/ 



then the optimal controls are 













= fc" 1 + A + e< 
















= fc~ 1 -A + £< 




J HttI d£, 











27rg + Diag(7r)| + O (e 2 ) 
2^g + Diag(^) I + O (e 2 ) 



(47) 



V>* = 2A;- 1 + 2eDiag(^) + O (e 2 ) 













+ 27rg| 



5.7 The effect of transaction costs 



For each asset S% we can add a transaction cost ctj. In order to use the vectorial notation as in (47 1, 
define 



olm 



and 



H{o) = 







^ Me --ffM(2M+a Af ) sinh(ii'MAM) 

It can be shown that the optimal controls under transaction costs are 

l-T 



<5£» = k- 1 + a + A + e { -E t , 



6-„ = k- 1 + a - A + e { +E t , 



H(a)nl d£ 



t/> q * = 2AT 1 + 2a + 2eDiag(^) + O (e 2 ) 

rT 



H(a)nl d£ 

2 



-2frg + Diag(7r)| + O (e 2 ) 
+ 2^g + Diag(^) I + O (e 2 ) 



(48) 



s + A - e i E(, 



fl"(a)7fld$ 



2* 9 } + O (e 2 
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Observe that the optimal market-maker's spread ip a * do not have crossed terms, i.e. the spreads are 
independent. The optimal centres r aie do have cross-correlation terms involved via ff, which enter into 
play twice: first, as weights on the directional bet H(a), integrated over the remaining time T — t; second, 
as weights on the inventory q. 

In the particular case where all mid-prices are martingales, i.e. A = and H(a) — 0, we have 

tp m = 2fc- 1 + 2a + 2eDiag(^) + 0(e 2 ) , (49) 
r aif = a - 2enq + O (e 2 ) . 



5.8 Risk management and iso-risk surfaces 

Recall that the inventory risk is given by H(q) = e[IT 1 (g) + n 2 (<?)], where 

i-T 



n 1 (q)= V q / nq, n 2 (q):=uj jAqd£ 



with both fl and A are M x M symmetric, positive-definite matrices. The (M — l)-dimcnsional iso-risk 
surface at level c > is defined as 

isoRisk(c) = {w e R M : Ii(w) = c) . 

Observe that q'tlq and q'Aq are positive-definite bilinear forms in M M , hence they define two norms 
equivalent to the Euclidean one. Moreover, their corresponding level surfaces are ellipsoids. This implies 
that the iso-risk surface isoRisk(c) is determined by the level curves of IT! and n 2 , i.e. by the norms of 
the inventory vector q induced by Q, and A. 

In order to assess the effect of cross-correlation in the risk measure, let us consider the simplest 
example, i.e. M = 2, e = 1, ?? = !, v = and 



Q - 

In this case, the iso-risk curves are ellipses, i.e. 



1 P 
P 1 



, pe[o,i). 



isoRisk(c) = {w — {w\, W2)' € K 2 : II(uj) = c} , H(w) = w\ + 2pwiw 2 + w\ . 

Now, assume that the market- maker looks for an inventory q = (qi,q 2 )' such that \qi \ + \qi \ = 2. Then 
the possible configurations (modulo the symmetry q\ o q 2 ) are 

q A = (2,0)', q B -(1,1)' and q c = {l,-l)'. 

Since 

n(g A )=4, U(q B ) = 2(1 + p), U(q c ) = 2(1 - p) , 

if p € (0, 1) then 

U(q c ) < U(q B ) < n(q A ) , 
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i.e. all three configurations belong to a different iso-risk ellipse and qc is the unique configuration min- 
imising the risk. Therefore, the market-maker will privilege qc over qA and qB- 

Imagine that the current inventory of the market-maker is (2, —1)', i.e. an absolute inventory of 3. If 
she desires to reduce her absolute inventory to 2 then she will choose (1, —1)' over (2, 0)'. However, it is 
not always desirable to reduce the absolute inventory: if p > 1/4 then 

n((2,-l)') = 5-4p<4 = n((2,0) / ), 

which implies that the market-maker would prefer (2, —1)' over (2, 0)', even if in terms of absolute inven- 
tory this choice seems counter-intuitive. 

In the general case, the philosophy is the same: the risks in inventory are not measured in absolute 
terms, but in terms of the bilinear forms q'Qq and q'Aq and how they affect the iso-risk surface isoRisk(c). 
Therefore, in the presence of non-zero correlations the market-maker will favour a well-diversified port- 
folio: the optimal inventory vector q is computed following the same optimisation as for the Markowitz 
Portfolio and the Efficient Frontier. 
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